# Chart Understanding Optimization
Webssl Mae700m Full2b 224
This is a 700M-parameter Vision Transformer model trained on 2 billion web images using masked autoencoder self-supervised learning, without language supervision.
Image Classification
Transformers

W
facebook
15
0
Moondream2
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient operation across all platforms.
Image-to-Text
M
vikhyatk
184.93k
1,120
Featured Recommended AI Models